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ABSTRACT 


An analysis of U. S. Navy major aircraft accidents 
during the period Fiscal Year 1972 - 1974 was 
conducted. Forward (stepwise) Multiple Regression 
technigues were employed on a group of ten basic 
variables considered time dependent. The multiple 
regression techniques were employed EO develop 
predictive equations for the dependent variable, 
Accident Rate with a view to determining which of the 
basic variable measures were Significant in accident 
rate studies and if the variables are unigue to a 
specific aircraft community or generally applicable to 


all aircraft. 


Aircraft considered independently were A-4, A-6, 
A-7, and F-4, additionally composites of Attack 
aimemact  (A~a;eAra, A-5, A-G, A-7), Fighter aircraft 
Geemarc PEQ@pelter aircraft (E-1, E-2, C-1, C-2, 
S-2, P-3, C-117, C-118, and C-130) and Helicopters 
(i= 1, Be2,98-3) H-46, and 8-53) were considered. 
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I. INTRODUCTION 


ne > eS eS ee 


The increased sophistication of military aircraft and 
the related increased dollar costs of procuring both 
aircraft and pilects places ever greater emphasis on the 
importance of determining a feasible method of reducing 
losses through aircraft accidents. Many research efforts to 
date have dealt with determining the causal factors 
underlying an aircraft accident, then using these causal 
factors, attempting to develop predictive nodels Loc 


accident occurence. 


Aircraft accidents have been broadly categorized in 
terms of aircraft design malfunctions, aircraft equipment 
failures, pilot or flight personnel error, and weather as 
primary causes for occurrence of mahor aircraft accidents. 
An accident is designated as a major accident if: 1) loss 
of life is involved; 2) complete loss of an aircraft is 
involved; or 3) substantial damage occurs to any aircraft 
involved. Substantial damage is defined in appendix A of 
OPNAVINST 3750.6 (Series). 


The most common cause of aircraft accident cited has 
been pilot error. Brictson, et. al. (1969) studied a four 
year span of aircraft carrier landing accidents involving 
attack and fighter aircraft. Approximately seventy-eight 
percent of the accidents studied had pilot error as the 
primary causal factor. Brictson noted that the majority of 
the accidents were of two types, hard landings and 
undershooting the landing area. The small deck carriers 
accounted for seventy percent of the total accidents even 


though the large deck carriers had more activity. 








Studies conducted for the Royal Air Force by Goorney 
(1965) dealt with the human factors involved in pilot error. 
He determined that pilot fatigue, emotional stress, 
complacency, lack of current flying experience contributed 
to pilot error and, if monitored, could lead to prediction 
of the likelihood of pilot error related accidents. 


There are some analysts who feel that if pilot error is 
a primary cause of accidents then the more proficient pilot 
should make fewer errors. This belief leads to the 
hypothesis that measures of pilot proficiency could be used 
aS predictive measures. Keller (1961) hypothesized that 
Eight time was positively correlated with Prloe 
proficiency. He stated that were a pilot to fly the proper 
amount he would attain a safe proficient ability as a pilot. 
The procedure of how to determine the proper amount of 
flight time necessary to attain proficiency and how the 
number of hours needed would interact with fatigue and 


complacency were not fully explored. 


Collicot, et. al. (1972) compared accident rates of 
Single-seat aircraft with those of dual piloted aircraft. 
They noted that if the operations were about equal the dual 
piloted aircraft had fewer accidents per ten thousand flight 
hours than the single piloted versions. Though the authors 
refer thrcughout their study to pilot proficiency they also 
allude to a possibility of temporary mental overload as a 


critical factor underlying pilot error. 


The determination of pilot error tends therefore to 
expand to include emphasis on temporary mental overload as 
well as pilot proficiency measures. Efforts by Kowalsky, 
et. al. (1974) were made to examine causal factors in high 
pilot error rates. Previous efforts to reduce pilot error 
had concentrated on improving pilot proficiency. Kowalsky 


and his co-researchers used cluster analysis and pattern 





recognition technigues and discovered that the single most 
lmportant causal factor was that, for non-training, 
non-midair accidents, pilots were often temporarily 
overloaded’ and incorrectly evaluated information presented 


during the period of overload. 


Many studies and much effort has been expended in 
accident research. The extensive data base maintained by 
the Naval Safety Center of accident related information 
opens doors for further statistical analysis of accident 
Cates with goals of constructing useful predictive 


mathematical models. 


Myers (1974) hypothesized that measures of pilot 
proficiency and experience available in data collection 
banks would be sufficient to construct a predictive model. 
He used statistical techniques of principle component 
analysis applied to two groups of fifty piiots. One group, 
pilots who had been involved in aircraft accidents, the 
second, pilots with no accidents. The results were not as 
good as was desired, possibly due to the limited sample 


sizes employed. 


A second approach was used by Stucki and Maxwell (1975) 
who used the techniques of regression analysis applied to 
data on cver two thousand aircraft accidents as collected by 
the Naval Safety Center. Their efforts dealt with pilot 
proficiency variables, aircraft variables and type of flight 
information. They then applied regression analysis to the 
composite group of all accident involved aircraft in the 
Navy's inventory. This effort yielded a predictive equation 
composed of four pilot related variables Eo predict 


Varlations in aircraft accident rates. 


Work by Robino (1974) reported fluctuation in aircraft 
by months with the month of March significantly higher. 





Subsequent efforts in this area by Poodk (1976) failed to 
Support the March phenomena and go on to demonstrate that 
fluctuations in aircraft accident rates by month is. purely 


randon. 


The author of this study believes that the premise 
promoted by Myer, Stucki and Maxwell and others is valid. 
There should be sufficient data available on current 
aircraft accidents to conduct detailed statistical analysis 
with the resultant predictive equations both meaningful and 
useful. The variable nature of aircraft accident rates 
Suggest that the underlying factors may be definable and, if 


they can be determined, used in accident prevention. 


If statistical analysis of aircraft accident rates can 
provide information on accident related variables, be they 
pilot-oriented, aircraft oriented or related to some other 
source, which vary either directly or inversely with 
aircraft accident rates then preventative actions can be 
taken to suppress the enormous costs in dollars and human 


life associated with aircraft accidents. 





Monthly accident rates exhibit a marked variability when 
each calendar month is compared to other months. The belief 
that some months are consistently higher that others has 
been noted freguently in studies. This phenomena has’ been 
noted in studies of U. S. Air Force accident rates by Zeller 
and March (1973) and by Robino (1972) in a study of Navy 
aircraft accident rates. Recent work by Poock (1976) at the 
U0. S. Naval Postgraduate School displays no statistical 
basis for any month being consistently high and attributes 
the fluctuations to random effects of the underlying causal 


factors. 


The accident rate is defined as the total number of 
accidents ina given month times ten thousand hours divided 
by the total number of flight hours flown that nonth. 


The efforts of this study, motivated by work of Stucki 
and Maxwell (1975), were to explore accident rate dependence 
on time related variables by specific aircraft types where 
possible and composites of aircraft types where necessary. 
The results desired are a series of predictive equations 
unigue to a specific aircraft or a community. It is 
believed that if in fact aircraft type has no large effect 
on accident rates that the data will yield similar equations 


for each type considered. 
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This chapter contains the data selection procedure, the 
techniques employed in data preparation, a description of 
the analysis procedures and a summary of decision criterion 
employed in selecting the best equation for predicting the 
Variance in the dependent variable rate. 


A. DATA SOURCE 


All Navy and Marine aircraft accidents and incidents are 
reported in detail to the Naval Safety Center, NAS, Norfolk, 
Va. The reporting criteria is detailed in Navy Aircraft 
Accrdent, Incident and Ground Reporting Procedures 
(OPNAVINST 3760.6 (Sseries)). As Naval Safety Center is a 
repository for all data recorded on aircraft accidents they 


are the source of data used in this report. 


B. DATA SELECTION 


As the goal of this study is to apply the concept 
envisioned by Stucki and Maxwell (1975) to individual type 
aircraft where possible and to group type of ai rcraft where 
necessary, the same basic data set as provided Stucki and 


Maxwell by the Naval Safety Center was employed. 


Table 1 lists the data initially reguested from and 
provided by Naval Safety Center. 
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TABLE 1 
DATA SET REQUESTED FROM NAVAL SAFETY CENTER 


Data concerning the pilot: 


Age 

Injuries . . 

Number of previous service tours 

Total flying time in aircraft model in which 
accident occurred. 

Total eens 8 previous ninety days. 

Total nighttime flight hcurs in previous ninety days 
Total daylight carrier landings in previous thirty 


WM NOM FWNe 


ays 
Total hight carrier eae in previous thirty days 
Number of years as designated Naval Aviator 


Data concerning aircraft: 


Model 

Damage ; 

Number of tours between major aircraft rework 
Type of last Bey OF inspection 

Hours since last inspection 
Identification of the system or component failure 


ANE WA 


Data concerning the flight: 


Major command 

HE DOT ANd custodian 
Ships's hull number (1f appl 
Marine Air Wing (if applicab 
Location 

Flight Purpose Code 

eee of operation code _. 
Phase of operation in which the accident occurred 


icable) 
le) 


COnNIANIE Wh — 
ooeoeeeee @ 


Data concerning the accident: 


pee oon identification number including calendar 
ate 

Other aircraft damaged 

EAE C epee se injured 

Contributing causal factors _. 

Special data not otherwise listed 

Weather 
Accident rate for the month in which the accident 
occurred 


SDE WN) on 
eeeee08 @ 


From the available data set ten basic variables were 


selected in cooperation with Naval Safety Center personnel 
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for inclusion in this study (see Table 2). 

In general, multiple regression requires that variables 
are measured on interval or ratio scale and that the 
relationships among the variables are linear and additative. 

The data set provided consisted of information on 
Two-Thousand-One-Hundred-Ten accidents or incidents which 
occurred during Fiscal Years 1969 to 1974, inclusive. 
Selection of a suitable time span was based upon the 
following basic constraint considerations: 

(1) The necessity to have as large a sample size as 
possible to enhance validity of statistical inferences. 

(2) The desire to restrict the years in the sample to 
periods when the aLECEatt inventory was reasonably 
consistent. 

(3) The necessity to consider gaps and inconsistencies 
in desired data points due to changes and/or modifications 
in data collection requirements and recording procedures 
that had occurred within the Accident Reporting Systen. 

To accomplish these major considerations the entire data 
base waS initially included. Then each variable was 
examined throughout the entire data set in terms of how 
changes in the reporting procedure or inconsistencies in the 
data would effect that variable. This treatment resulted in 
the reduction of the data base to the 
FPive-Hundred-Sixty-Nine accidents occurring during the three 
year period FY 72-74 inclusive. 

During this period the inventory of aircraft with which 
this study deals was reasonably constant. 

While this treatment did create a complete data base 
wherein all information desired was available for all 
accidents the resulting size does hamper the investigation 
of dircraft types in cases where the inventory 1s small to 
begin with and/or where there are few accidents as in the 
A-3 community. This leads in some instances to grouping 


data under categories such as Propeller, Attack or Helos. 
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C. VARIABLE SELECTION 


2 e =— 
The ten bkaSic variables selected for inclusion are shown 
in Table 2. 


TABLE 2 
DATA SET INCLUDED IN CURRENT STUDY 


1. Accident rate by month (RATE) 
2. Pilots age (AGE 
3. Total flight time in accident involved aircraft 
model {tt BE). } ; 
4. Total flight time during ninety days preceding 
accident (TOT9Q) 
5. Total night flight time during the preceeding ninety 
hights (NITE9Q) . 
6. Daylight carrier landings during the preceeding 
thirty days (CLDAY) j : 
7. %\Naght carrier landings during the preceeding thirty 
nights (CLNITE) 
8. Number of aircraft tours (ACTOUR) 
9. Aircraft flight hours since last major or calender 
inspection (ACHRS) ; ; 
10. Number of years designated Naval Aviator (DNA) 


In addition to the basic variables the author used an 
eleventh variable, DAY90 = TOT90 -~ NITE9O, which is the 
total daylight flight time in the preceeding ninety days. 

Pilots age and years designated Naval Aviator were 
included as they are variables that are historically used as 
indicators of maturity and perhaps proficiency. If, as the 
author believes, the hypothesis that the older pilots’ tend 
to be safer pilots through a finer sense of judgement of 
risks involved is a valid hypothesis, the author would 
expect a negative simple correlation between AGE and DNA 
With rate. However because the number of other confounding 
factors is great a negative correlation would not justify 
the acceptance of the hypothesis. 

The variables consisting of pilot flight hours and 
carrier landings are considered to be measures of pilot 


Currency and proficiency by many in the Navy and are 
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therefore included. 

Aircraft tours is included as a measure of the general 
condition of the aircraft and as an indication of aircraft 
age. Each aircraft in the Navy's inventory undergoes a 
Periodic Aircraft Rework (PAR) for analysis, cepair and 
conversion at intervals unique to the model aircraft after a 
specific number of flight hours. This variable also serves 
to monitor any reliability anomalies other than 
"new-better-than-used" as mentioned by Butterworth, et.al. 
(1974) . 

Aircraft hours is included as a measure of aircraft 
condition and usage since major inspection, primarily the 


calendar inspections. 


D. DATA PREPARATION 


The kasic assumptions for multiple regression analysis 
require that data be measured in at least interval or ratio 
scale and that the relationship among the variables be 
linear and additive. 

All data points used were ajudged to be measured on an 
interval scale. Raw data for each type or group of aircraft 
was averaged by months for each of the thirty-six nonths 
included in the data set where there was an accident for 


that type aircraft. 


E. THE ANALYSIS TECHNIQUE 


The analysis procedure employed was Multiple Regression 
using Forward(stepwise) Inclusion. The Statistical Package 
for the Social Sciences(SPSS) compiled and edited by Nie, 
et.al. (1955) includes a forward stepwise multiple regression 
computer program package developed by Jae-On Kim and Frank 


J. Kohout at the University of Iowa. 
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This package was selected as the means of conducting the 
Statistical analysis of the data sets. 

Kim and Kohout state that forward stepwise multiple 
regressicn is a recognized technique: "(1) to find the best 
linear prediction equation and evaluate its prediction 
accuracy; (2) to control for other confounding factors in 
order to evaluate the contribution of a specific variable or 
set of variables; and (3) to find structural relations and 
provide explanations for seemingly complex multivariate 
relationships, such as is done in path analysis." 

The computer program provides the user with various 
options for treatment of data sets, calculaticn of 
statistics and output formats. The procedure, Listwise 
Deletion of Missing Data, is the default option and the most 
conservative in that it maintains sample size and is the 
most accurate. Since the data base finally arrived at was 
complete, no data was deleted by the computer program 
option. If the data base were missing quite a few 
individual data points this procedure could result in a 
drastic decrease in the sample size. This fact was one of 
the underlying considerations in the data base selection 
criteria. 

The options not selected could, if not used with 
considerable prudence and judgement, result in the 
introduction of large amounts of bias that would be very 
difficult to detect without a very good feel for expected 
experimental results. It is for this reason that the 
Listwise Deletion of Missing Data option was used for all 
computer runs in the current study. Kim and Kohout state 
that: 

"There are many occasions for which simple linear 
SEER tp de Gxpected (on tae basis of ‘t eory) to. take 
is siaply upknowa, and tie canination of the scatter piots 
Suggests clear deviation from linearity or, at least, the 
pee (Sy Efe cogpiuca°eiuccs Of the indepenient “var lables 
Mee Mites cituations are (1) to transfora the original 


variables in such a way that the resultant relationShips 
among the transformed variables become linear, (2) to find 
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a Simple nonlinear form through the use of polynomial 
regression, and (3) to introduce interaction terms as new 
Variables." 


There are two extreme viewpoints in regression 
analysis, with valid arguments to support both cases. 
Draper and Smith (1966) explain that the two opposing 
viewpoints are "(1) to make the prediction equation valid 
you should include as many predictor variables as possible; 
and (2) because of increased cost of obtaining variables and 
monitoring them, the equations should include as few 
variables as possible." 


The process of selecting the best regression equation 
1s the process of compromising between these two extreme 
viewpoints. There is no unigue statistical procedure for 
choosing the 'best' equation and large amounts of personal 
judgement are required. In this regard the techniques of 


regression analysis become an art as well as a science. 


Pnitial analysis of data from the current study 
displayed indications of nonlinearity and interactive 


effects between independent variables. 


To deal with these effects no single set of regression 
Variables were deemed ‘best! but rather a series of seven 
different regression variable packages were constructed. 


Each data set was run with all seven different packages. 


Variables eight and nine from Table 2 were deemed 
aircraft oriented variables while the remaining basic 


Variables were considered human oriented measures. 


Regression I consisted of only those basic variables 
that were human related. Regression II contained all the 
basic variables pilot-or aircraft oriented. Regression III 


used the pilot oriented basic variables plus transformations 
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consisting of the square and the square root of each basic 
variable used. Regression IV includes all of Regression III 
plus the square of, the square root of and the two aircraft 
related basic variables. Regression V contains all of 
Regression III plus the twenty-eight possible cross-products 
of the eight basic variables. Regression VI contains all of 
Regression V plus the aircraft related basic variables,. 
their squares and square roots and the cross product of 
Aircraft Tours and Aircraft Hours. Regression VII contains 
the ten basic independent variables, their squares, square 


roots and the forty-five possible cross-products. 


The decision to employ squares and square note was 
made to provide a larger number of variables capable of 
accounting fOr Ccurvilinearty. The introduction of 
cross-products allows for interactive effects of independent 
Variables. The use of the Forward (stepwise) Inclusion 
Multiple Regression computer program facilitates the 
creation and inclusion of many various transforms. The 
packages used in the study were considered the most 
versatile of the trial packages used in preliminary studies 
by the author. 


FP. DECISION CRITERIA 


The forward stepwise multiple regression program 
contains preselectable stopping criteria that were adjusted 
to facilitate introduction of variables into the equation 
that by themselves made a significant contribution to 
explaining the variance in the dependent variable RATE. As 
a ‘rule-of-thumb! in predictive equation selection the study 
attempts to restrict the number of variables in each 
equation to five. This decision is based upon the degrees 


of freedor in the regression equation and the need to 
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Maintain a Significant ratio to provide a solid statistical 
base for conclusions. 


With the degree of freedom restrictions attained the 
primary decision criteria are: 


1) The eguation with a Significance level of 
100 (1-alfa) percent greater than or equal to ninty-five 
percent, and 


2) That equation that accounted for the largest amount 


of variance in the dependent variable. 


In those cases where the choice of the ‘'best' equation 
was not clearly indicated other more subjective measures 
were employed, such as, examination of scatterplots of the 
Standardized residual versus the standardized predicted 
dependent variable, the plot of the standardized residuals, 
and consideration of the intuitive impact of the particular 


variables in the eguations under consideration. 


FOr example, all other decision criteria being 
statistically equal the equation containing CLDAY - (CLDAY) 2 
+ (RTCLDAY) would be selected over the equation containing 
(AGE) (ACTOURS) + (TTIME) (ACHRS). 


Since many of the regression packages were very similar 
some cases cculd yield the same equations for more than one 
regression package, while other cases could yield no 
Significant equation for any regression package. In either 
case for ccmpleteness the 'best' equations is indicated in 
the results even if that equation is not statistically 


Significant. 


This chapter has described the analysis procedures used 


in the development of predictive equations for variance in 
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accident rates by month. The next chapter contains the 
results of the analysis by aircraft community and aircraft 


type where possible. 
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IV. RESULTS 


The results by aircraft type or aircraft community are 
contained in this chapter. The best predictive equation 


provided by the seven Regression Packages are shown. 


A. ATTACK AIRCRAFT 


The aircraft of the attack community were divided into a 
composite regression and three separate regressions. The 
composite consisted of accident involved aircraft of the 
types A-3, A-4&, A-5, A~6, and A-7. All variants of each 
type aircraft were included (for example, KA-3, EA~3 and 
A~3). The three separate regressions were conducted on A-4, 


A-6, and A-7 respectively. 


1. Attack Composite 


This category was used to investigate trends unique 
to the Attack community and not peculiar to a specific 
attack type aircraft. Additionally, the relatively small 
size of the A-3 and A-5 communities precluded an independent 
analysis of these aircraft. Rather than omit A-3's and 
A-5's they were included here. The accident involved 
aircraft in this category provided the maximum sample size 
of thirty-six data points for analysis. As the number of 
aircraft included of each type are not equal the category 
may be biased towards the larger A-4 and A-7 communities, 


however, the author felt that this would in no way endanger 
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the results as the Attack community is being considered as a 
community. While data was available for A-1's they were not 
included for two reasons, firstly, it was desired to limit 
the category to jet aircraft and secondly, the A-1 has been 
phased out of the active aircraft inventory. 


The basic variables in the regression accounted for 
less than twenty percent of the variance in accidents rate, 
however, the regressions consisting of transformed variables 
yielded two equations of approximately equal quality which 
ares 


A. Rate (ATTACK) = Q0.98593 - 0.01221 (ACTOUR)2 + 
0.00232 (ACHRS) - 0.00226 (CLDAY)2 + 0.62245 (RTCLNITE) - 
0.00193 (NITE9O) 2 

and, 

Bes Rate (ATTACK) = Q.79467 - Q.00268 (CLDAY)2 + 


0.68955 (RTCLNITE) - 0.00664 (NITE90)2 + 0.15186 (NITE9O) 
0.09602 (RTTOT9O). 


Equation A and equation B are both significant at 
the 99 percent level and equation 8B accounts for 59.74 
percent of variance in rate contrasted to 51.11 percent for 
equation A. The author's deciSion criteria were met Dy both 
equations, however, examination of residual plots’ favored 
eguation A by a narrow margin. The Forward (stepwise) 
Regression criteria selects variables for inclusion in the 
predictive eguation by adding the variable that accounts for 
the largest increase in the percent of variance in the 
dependent variable. It is of interest to note that the 
aircraft oriented variables entered the predictive equation 
first in Equation A followed by the pilot oriented 
Variables. With the deletion of aircraft variables the 


second equation provided by only pilot oriented variables 
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accounted for an additional 8.63 percent of variance in rate 
but with a lesser initial effect of the first two variables 
added. 


It can be observed that equation A contains two 
aircraft related variables while equation B is composed 
entirely of pilot oriented variables of which three are 
included in both equations. The predictive equation  (B) 
contains the variable NITE90 in two functional forms. The 
net effect on the dependent variable rate is positive for 
values of NITE9O0 less than or equal to 22.87 hours. For 
hours greater than 22.87 the effect is to reduce the 
predicted monthly accident rate. 
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This category contains all accident involved A-4 and 
TA~4 aircraft in the three year period studied and provides 
a sample size of thirty-one cases. The regression of the 
basic variables accounted for only 22 percent of variance in 
rate at a Significance level of 75 percent. The predictive 


equation considered 'best' was: 


Rate (A-4) = -0.00804 (DNA)2 + 0.10473 (RTACHRS) 
-0.00010 (DAY90)2 + 0.77584 (RTNITE90) - 0.11160 (NITE9O) - 
0.13246. 


This equation, Significant at the 95 percent level, 
accounts for 42.68 percent of the variance in rate. 


It is noted that four of the five variables are 
pilot oriented variables three of which are based on hours 
flown. The predictive eguation for A-4 aircraft contains 
NITE9O in two functional forms as did the predictive 


equation for the Attack community. Again as in the Attack 
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community the net effect of NITE90 is positive for the lower 
number of hours flown. Particularly the net effect is 
positive for NITE90 less than or equal to forty-eight hours 
and negative for values greater than forty-eight. 





This category was restricted to a sample size of 
twenty due to relatively few accidents and a smaller 
community. In order to achieve the largest sample size 
possible the author included EA-6 aircraft with the A~6 and 
KA-6 models. The smali sample size tends to make suspect 


any results derived by regression analysis. 


The basic variables accounted for less than ten 
percent of the variance in rate while the ‘'best' predictive 
equation accounted for 40.27 percent. This equation is 
however significant only at the 75 percent level. The 
author feels that the small sample size tends to negate any 
usefulness of this regression. The equation is included for 
continunity cf the study and for discussion purposes. The 


equation is: 


Rate (A-6) = 16.28967 - 0.04604 (DNA)2 + 2.33592 
(RTDNA) - 20.30561 (RTACTOUR) + 5.34649 (ACTOUR) + 0.05874 
(RTTIME) . 


It is noted here that only three basic variables are 
used in scme functional form with a balance of aircraft and 
pilot oriented variables used. The independent variables 
DNA and ACTOUR are each used in two functional forms. The 
net effect of ACTOURS is positive for values greater than or 
equal to 15 tours while the net effect of DNA on the 
dependent variable rate is positive for values less than or 


equal to 13.70 and becomes negative for values greater than 
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13.7 years. 
4. Aw7d Aircraft 


This category provides a sample size of thirty-two 
cases based on all A-7 aircraft models involved in accidents 
during the study period. The regression analysis yielded 


the following predictive equation: 


Rate (A-7) = 0.27170 (RTCLDAY) - 0.01856 (CLNITE)2 + 
0.21346 (RENITE90) + 4.01164 (RTDNA) - 0.88896 (DNA) - 
3.70545. 


This equation, Significant at the 99 percent level 
accounts for 55.01 percent of the variance in rate. 


The predictive equation for A-7 aircraft also 
contains two functional forms of DNA. Again here as in the 
A-6 the net effect is negative for large values of DNA, 
greater than 21 years inthis case. While the effect on 
rate 1S positive for values less than 21 years the net 
effect decreases as the value of DNA approachs 21 years. 
This agrees with the intuitive feeling that DNA is a measure 
of pilot proficiency and the more experience the safer the 


plot. 
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TABLE 3 
ATTACK AIRCRAFT VARIABLE SUMMARY 


VARIABLE ATTACK-A ATTACK-B A-4 A-6 A-7 FREQ 
NITE9O 0.07382 0.12366 2 
ACTOUR -0.17445 1 
ACHRS 0.31152 1 
DNA OAS ace) a 
DAY902 -0.12592 1 
NITE902 -0.04184 -0.04184 Via 
CLDAY2 -0.31728 -0.31728 V1 
CLNITE2 0.23345 1 
ACTOUR2 -0.37244 1 
DNA2 -0.31353 -0.34703 2 
RTTIME -0.03596 1 
RTTOTIO 0.06426 1 
RTNITEIO Oreo ta 0.22398 2 
RTCLDAY 0.34684 1 
RTCLNITE 0.13914 0.13914 1/1 
RTACTOUR -0.14537 1 
RTACHRS 0.30860 1 
RTDNA -0.24288 0.22979 2 

H 


TABLE ENTRIES ARE THE CORRELATION COEFFICI£NT OF T 
DISPLAYED VARIABLE WITH RATE 


5 


Table 3 displays the basic and transformed variables 
as used in the regression package. The suffix '2' indicates 
that variable squared and the prefix 'RT' the square root of 
the variable. Tabled are the correlation coefficients of 
the displayed variables with the dependent variable rate. 

For those cases where the variable appears in nore 


than one equation the correlation coefficients are quite 
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consistant with the exception of RTDNA where the coefficient 
for A~7 is contrary to what would be normally considered 
true. While it is generally believed, the remaining 
aircraft types bear out, that the adage that the older, more 
experienced pilot has fewer accidents, this does not appear 
to hoid for the A-7 aircraft. It is noted that the values 
for DNA and RTDNA in Table 3 for A-7 are positively 
correlated, in addition the coefficient for pilot age is 
also positively correlated for A-7. The author is unable 


from his experience to explain this unusual occurance. 


TABLE 4 
ATTACK AIRCRAFT BASIC VARIABLE SUMMARY 


VARIABLE ATTACK~-A ATTACK-B A-4 Azo A= FREQ 


AGE 0/0 
TTIME 1 1/1 
TOTIO 1 0/1 
DAY90 1 V/1 
NITE9O 1 2 2 1 4/5 
CLDAY 1 1 2/2 
CLNITE 1 1 1 2/2 
DNA 1 2 Ss 
ACTOUR 1 2 3/2 
ACHRS 1 1 2/1 


Table 4 relates the usage of basic variables in each 
category in some functional forn. Basic variables AGE, 
TTIME, TOT90, and DAY90 are used one time or less indicating 
that these measures have little or no effect on predicting 
variance in aircraft accident rates. NITE90 and DNA are the 
high usage variables followed by CLDAY, CLNITE, ACTOUR and 
ACHRS with moderate usage. 
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B. FIGHTER AIRCRAFT 


The analysis of fighter aircraft was restricted to F-4 
and a composite of F-4 and F-8 aircraft. The data base did 
not provide enough data to conduct independent analysis of 
F-8's by themselves which led to the composite category. 


1. Fighter Composite 


This category was included to provide a method of 
including F-8 aircraft and to facilitate the possible 
contrast of the Attack and Pighter communities. 

The composite analysis yeilded a sample size of 
thirty-six cases primarily on the strength of the F-4 
community. The basic variable regression accounted for less 
than twenty-five percent of the variance in rate at a 
Significance level of 95 percent. Once again the regression 
using transformed variables provided a better predictive 


eguation as shown below. 


Rate (Fighter) = 1.21906(RTACTOUR) + 0.23768 
(RTDAY90) + 0.01897 (CLDAY)2 + 2.38695 (RTCLDAY) -0.92126 
(CLDAY) - 2.48215. 


This equation accounts for 40.45 percent of the 
Variance in rate and is significant at the 99 percent level. 

It is observed that the variable CLDAY appears in 
each of its functional forms and while it does not account 
for the most variance initially in conjunction with the 
forms of ACTOUR and DAY90O it adds about sixteen percent to 
the accounting of variance in rate. The net effect on the 
dependent variable rate of the variable CLDAY is positive 
for values less than or equal to 11.56 daytime carrier 


landings in thirty days. For values greater than 11.56 the 
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net effect becomes negative and will tend to decrease the 
accident rate. 


2- F-4 Aircraf 





The category of F-4 aircraft consisted of a sample 
size of thirty-six data points. Regression analysis yielded 
the following predictive equation: 


Rate (F-4) = 0.19142 (RTDAYIO) - 0.03663 (CLNITE)2 - 
GO2000002 (ITIME)©= + 0.17982 (RITIME) ~-0.01302 (DNA)© - 
1259073. 


This equation accounts for 34.79 percent of variance 
of rate at significance level 95 percent. The equation 
generated by the basic variables alone accounted for less 
than nine percent and were not significant at the 75 percent 
level. 

The predictive equation deemed ‘best! was generated 
from Regression III and contained only  pilot-oriented 
variables. 

The variable TTIME appears here in two functional 
forms with a positive net effect on rate for values less 


than or equal to 2006 hours. 
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TABLE 5 


FIGHTER AIRCRAFT VARIABLE SUMMARY 


VARIABLE FIGHTER r=4 FREQ 
CLDAY -0. 16666 2 
TTIME2 ~Q.11285 1 
CLDAY2 -0.26346 1 
CLAITE2 ~0.19546 1 
DNA2 —Oello2s 7 1 
RTTIME 0.08688 1 
RTDAY9O 0.24865 0.23319 2 
RTCLDAY 0.00643 1 
RTACTOUR 0.44776 1 


TABLE ENTRIES ARE THE CORRELATION COEFFICIENTS OF THE 
DISPLAYED VARIABLE WITH RATE 


In the case RTDAY90 which appears in both equations 
the correlation coefficients are quite consistent. It is 
noted that the variables normally considered as pilot 
proficiency variables are negatively correlated with rate, 


except in the cases where square roots are used. 
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TABLE 6 


FIGHTER AIRCRAFT BASIC VARIABLE SUMMARY 


VARIABLE FIGHTER P-4 FREQ 
AGE 0 
TTIME 2 2 
TOTI9O 0 
DAY90 1 1 2 
NITESIO 0 
CLDAY 3 3 
CLNITE 1 1 
DNA | 1 1 
ACTOUR 1 1 
ACHRS 0 


Table 6 relates the usage of basic variables in some 
functional form by category. The basic variables CLDAY, 
TTIME, and DAY90 are the high usage variables in this 
category and are all pilot related variables. The variables 
AGE, TOT90, NITE9O and ACHRS did not appear in any form in 
the fighter community 


C. PROPELLER AIRCRAFT 


The aircraft considered in the propeller aircraft 
Capegoryeconscisted Of E~1, E=2, C-~1, C-2, S-2, P-~3, C117, 
C-118, and C-130. Due to the relatively small size of each 
individual ccmmunity and the infrequency of accidents it was 
necessary to combine all aircraft into one category entitled 
'PROPS'. This procedure 1s somewhat unnerving as there are 


normally aspirated and turboprop aircraft together as well 
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as carrier-based and land-based aircraft. This tends to 
bias the results and applications or inferences cannot be 
directed toward any particular member aircraft in the group. 

The result of aggregating the above aircraft is a sample 
size of twenty-six cases which provides an equation of basic 
variables that accounts for less than thirty percent of the 
variance in rate at a significance level of seventy-five 
percent. 

Regression packages V, VI and VII provided the same 
equation which was judged ‘best' by the author. 


Rate(PROPS) = 0.35935 + 0.00002[ (TTIME) (NITE90] - 
0.00022 (NITE90)2 - 0.00001 [ (AGE) (TTIME)] + 0.00108 
[ (CLDAY) (LNA) ] -0.00595 (CLNITE) 2. : 


The predictive equation accounts for 43.25 percent of 
the varlance in rate at a significance level of 95 percent. 
This category provides the only case where cross-products 
contribute to the predictive equation. The eg uation 
consists of only six basic variables in some functional 
form, and all six are pilot oriented variables. This could 
be indicative of the inherent safety of large multi-engined 
propeller aircraft where a flight may be aborted due to a 
mechanical failure with a lower probability of an accident 
resulting frem the mechanical failure. 

The inclusion of pilot oriented variables in the area of 
Carrier landings casts doubt upon the validity of the 
predictive equation because many of the aircraft are not 
carrier-based and their pilots do not record Carrier 
landings. The equation is still considered valid by the 
author in that a value of zero was recorded for those pilots 
with no carrier landings and if the regression technique 
still selects that variable it is due to the correlation 
interactions with that variable and the combined independent 


Variables included in the equation with the dependent 
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Variable rate. 


TABLE 7 


PROPELLER AIRCRAFT VARIABLE SUMMARY 


VARIABLE PROPS 

NITE9O2 0.09764 
CLNITE2 ae OUD 
CLDDNA 0.21845 
DINGEE 0.55415 
AGEDNA 0.19878 


TABLE ENTRIES ARE THE CORRELATION COEFFICIENT OF THE 
DISPLAYED VARIABLE WITH RATE 


Papi eS 


PROPELLER AIRCRAFT BASIC VARIABLE SUMMARY 


VARIABLE PROPELLERS PREQUENCY 
AGE 1 1 
Tae ME z Z 
Re IO 0 0 
DAY9O 0 0 
NITESO 2 z 
CLDAY 1 1 
CLNITE 1 1 
DNA 1 1 
ACTOUR 0 0 
ACHRS 0 0 
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Table 7 shows that only one variable, CLNITE2, is 
negatively correlated with rate, while the remaining 
Variable forms are all positively correlated. Table 8 
reflects the degree of usage of each basic variable with 
total time and night hours being used twice and the 
remaining basic varaibles one time or not at all. 


D. HELICCPTERS 


The category helicopters consists of the aggregate of 
H-1, H-2, H-3, H~46 and H-53. This category. like that of 
Propellers did not provide sufficient data to conduct 
independent analysis by type aircraft. The aggregate 
yielded a sample size of thirty-three cases. 

The analysis yielded only two predictive equations for 
the seven regression packages. The equation provided by the 
basic variables was ajudged ‘best’ and is: 


Rate (HELO) = 0.00062 (TTIME) + 0.65405. 


This equation, significant at the ninety percent level 
accounts for 9.30 percent of the variance in rate. The 
second equation provided by the remaining five regressions 
accounted for 11.45 percent of the variance in rate. The 
difference cf 2.15 percent was not deemed sufficient 
increase to use the second equation which consisted of the 


variable TTIME squared. 
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V. DISCUSSION 


It is interesting to note that even though the variable 
AGE was the most significient single variable in the overall 
equation arrived at in the study by Stucki and Maxwell that 
AGE appeared only once in the current study. The variable 
AGE appeared as a cross product with TTIME in the prediction 
equation for propeller aircraft. The current study employs 
the variable DNA which was not used by Stucki and Maxwell. 
The variable DNA was used in some functional form seven 
times and represents the highest single useage of any basic 
variable. The simple correlation between AGE and DNA is 
quite high in each of the eight categorys. Intuitively this 
ilmplys that one or the other will dominate and both will try 
to account for the same portion of variance in rate 
explainable by this type of variable. A similar trend 
appears in looking at the usage of TOT90, DAY90 and NITE9O. 
As the sum of DAY90O and NITE90 equals TOT90O0 it would be 
expected that one of the variables would dominate the 
predictions. This is in fact the case as NITEIO is used six 
times, DAY90 three times and TOT90 enters only in the 
alternate best equation for Attack aircraft. The variables 
TTIME and CLDAY are each used six times in some functional 
form while CLNITE appears four times. The aircraft oriented 
Variables appear a total of six times in some functional 
form, four times for ACTOUR and twice for ACHRS. 

Although some of the Regression Packages yielded the 
same equation within categories, Regression Package IV _ can 
be credited with the best equation in five out of eight 
categories. Regression IV provided the best equation in the 
four attack aircraft categories and in the category Fighter 
apperaft. Regression Package V provided the Propeller 


equation, Regression et the Helicopter equation and 
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Regression III the F-4 equation. The predictive equation 
for P-4 is-the only case where the best equation was 
provided by a regression package that contained only pilot 
oriented variables. The category consisting of Helicopters 
was the only case where the transformed variables did not 
provide a large improvement over pure basic variables in the 
predictive equation generated. 

The majority of the variables entered in the eight 
predictive equations were of the variable squared or the 
Square root of the variable, thirteen and fourteen times 
respectively. Three cross products used six variables and 
the basic variables appeared six times. 

There does not appear to be any trend or tendency for 
any particular basic variable to be consistent over the 
range of the eight categories considered. If the hypothesis 
that the older more experienced pilot is a safer pilot is 
valid and if the variables of AGE and DNA can be considered 
as measures of this hypothesis, then the author would expect 
the simple correlation coefficients of these variables to be 
negative. This is not the case as five of the sixteen 
coefficients are positive. It 1s possible that due to the 
relatively small size of each sample and the fact that there 
are months where only one accident occured that a 
coefficient could be only slightly positive without 
violating the hypothesis. This does not explain the 
coefficients of the A-7 category (see Table 9) where the 
coefficient of AGE is on the order of 0.26 and DNA is 0.15. 
As stated previously the author is unable to explain this 
phenomena. Similar arguments can be generated for each of 
the ten basic variables. The closest case to being 
consistent in sign is with CLNITE where all coefficients are 
negative except that of A-6 which is 0.07. The value of 
0.07 for A-6 combined with the extremely small sample 
considered in this case (20 data points) leads the author to 


discount any significance in sign for this variable. 
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TABLE 9 


ATTACK A-4 A=6 AS 
AGE =0227305 -0.12430 -0.30084 0.26525 
TTIME -0.05110 -0.08715 -0.09278 0.17624 
TOT9O 0.02845 -0.01864 -0.02922 -0. 03100 
DAY9O 0.01701 -0.08675 0.02350 -0.07358 
NITEIO 0.07302 0.12366 =O oo 15 0.07386 
CLDAY =0 . W953 0.05906 -0.02135 0.21725 
CLUNITE -0.11847 -~0.01026 0.07686 -0.03701 
ACTOUR =VeooUS7 70.11151 -0.17445 -0.16477 
ACERS 0.31152 0529628 0.10899 -0.11530 
DNA -0.18404 -0.25457 -0.28943 0.15431 

FIGHTER F~-4 PROPS HELOS 
AGE -0.006448 0.00785 -0.06703 0.09091 
TIIME 0.15639 0.01361 0.21211 0.30509 
TOTSO 0.11077 0.14855 0.08224 0.09746 
DAYIO 0.23847 0.20732 -0.00069 0.12454 
NITE9O -0.22743 -0. 14624 0.16823 -0.000 36 
CLDAY -0. 16666 0.10926 0.20479 % 
CLNITE -0.16156 -0.12479 -0.22141 * * 
ACTOUR 0.42709 0.19403 0.14963 -0.07866 
ACHRS 0.03460 0.20765 0.12285 0.08244 
DNA =O:SO 256 =O l232 =o", 05969 0.04829 


** no carrier landings recorded 
TABLED VALUES ARE THE CORRELATION COEFFICIENTS OF THE BASIC 
VARIABLES WITH ACCIDENT RATE BY CATEGORY 


The net effect on Rate by the combined functional forms 
of a basic variable was discussed in the results by 
category. It should be noted, however, that adjusting the 
values of a baSic variable to bring about a decrease in the 
accident rate could potentially cause one of the other 
measures to shift in such a way that the total net effect 


would be to increase the accident rate. 
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VI. RECOMMENDATIONS 


ne? ee Gee 6S a 


\ 

The current study consisted of analysis of aircraft 
accident rate by type aircraft. In this approach the size 
of the communities and the resulting size of the data base 
are such that the analysis is constrained by the degrees of 
freedom available in regression techniques. As a future 
study the techniques and hypotheses employed in this study 
would be a useful starting point. The necessity of a larger 
data base would be overcome by time as the inventory studied 
1s not that different from the current Navy inventory. The 
sensitivity of some of the basic variables employed in this 
study suggest that future studies procure additional data of 
the following types: 

1) In addition to the number of day and night carrier 
landings in the past thirty days, a numerical grade of the 
quality of each landing made should be included. 

2) A breakdown of the hours flown in the preceeding 
ninety days to include, for example, flight hours in past 24 
hours, flight hours in past 72 hours. This would allow 
inclusion of concepts of fatigue versus proficiency. 

3) In addition to AGE and DNA, the number of months in 
operational flying billets and the number of months in 
CuwEEent tour. 

The data base employed contained much information on 
accidents and allows constructing.a profile of the pilot who 
had an accident. The single most severe hinderance to this 
author in drawing conclusions was the lack of adequate or 
equal kncwledge of the pilot who did not have an accident. 
It is recommended that prior to any future studies of this 
type the analyst procure data on accident free pilots with 
aS Many variables in common with the accident involved pilot 


as feasible. The hinderance to this author was that a 
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profile of the accident involved pilot may be identical to a 
profile of the non-accident involved pilot. Without the 
results of this comparison the usefulness of the predictive 

equations for prediction cannot be demonstrated 
. statistically. It iS possible however that the equations 
could be validated by using them to attempt to predict and 
comparing the actual resulting rates. 

The real benefits of this study are in the analysis of 
the variables that enter the equations and by using the 
frequency of appearance in planning future studies with even 
greater detail in those areas where the variables appear to 
contribute the most. 

While this study is somewhat broad in scope it does 
provide encouragement for future efforts along this line of 
reasoning. The ever increasing necessity to reduce loss in 
human life and dollars due to -aircraft accidents provides 
the incentive. 
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APPENDIX A 


AVERAGE MONTHLY DATA POINT VALUES 


This appendix contains the average monthly data point 


Values for the baSic variables and the dependent 
Rate. Each aircraft type or community examined in 
1s recorded in a table. 

The dependent variable Rate is the aircraft 
rate per ten thousand hours. Rate is calculated 
the number of aircraft accidents for the month 


variable, 
the study 


accident 


by taking 


times ten 


thousand hours and dividing by the total number of aours 


flown by that type aircraft for the month. 
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A-6 AIRCRAFT REGRESSION ANALYSIS DATA BASE FOR FISCAL YEARS JULY 1972-JUNE1974 
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F-4 AIRCRAFT REGRESSION ANALYSIS DATA BASE FOR FISCAL YEARS JULY 1972~JUNE 1974 
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